METHOD AND APPARATUS FOR GENERATING 
THREE-DIMENSIONAL DATA 



[0001] This application is based on Japanese Patent 
Application No. 2000-243860 filed on August 11, 2000, the 
contents of which are hereby incorporated by reference. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

[0002] The present invention relates to a method and an 
apparatus for generating three-dimensional data by 
detecting corresponding points of two two-dimensional 
images whose contents are similar to each other such as 
two two-dimensional images captured stereographically with 
respect to an object using a gradient method, a 
correlation method or the like. 

2. Description of the Prior Art 

[0003] Conventionally, there are known techniques for 
generating three-dimensional data from plural two- 
dimensional image data (a two-dimensional image is 
hereinafter referred to as an "image" and two-dimensional 
image data are referred to as "image data" in the present 
specification) as of an object captured by, for example, 
using a plurality of cameras. In generating the three- 
dimensional data, it is necessary to detect to which point 
of an image an arbitrary point of another image 
corresponds to, i.e. to detect corresponding points of the 
images. The correlation method is the most popular method 
for detecting such corresponding points . 
[0004] By the correlation method, the corresponding 



points are detected by setting an image as a standard 
image and finding an area similar to the standard image 
from another image. For example, the corresponding 
points are detected in such a manner by extracting a small 
area of the standard image as a standard window and 
detecting, in another image, a window having luminance of 
points similar to those of points of the standard window. 
The correlation method has other applications such as a 
method for detecting the corresponding points by a 
correlation of luminance on an epipolar line, a method 
wherein the size of a window is changed and a method 
wherein a correlation is detected by binarization of an 
image . 

[0005] Another known method for detecting the 
corresponding points is the gradient method. In the 
gradient method, the corresponding points are detected 
from luminance gradient of an area adjacent to a point 
whose coordinate is in common to plural images similar to 
each other. 

[0006] The correlation method is superior in detecting 
corresponding points of an object whose luminance or color 
gradation is widely varying on its surface. However, any 
of the above methods are not fully satisfactory for the 
detection of the corresponding points of an object that is 
poor in luminance change or an object that is low in 
contrast. Therefore, the correlation method is not 
suitable for generating three-dimensional data of a face 
of a human or the like. 

[0007] In turn, the gradient method requires shorter 
calculation time as compared with the correlation method 
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and is superior in detecting the corresponding points in 
the case of low luminance and low contrast. Further, the 
gradient method can detect the corresponding points 
correctly by using subpixels if the object is in an ideal 
state. However, the gradient method is not competent to 
differences in luminance or color gradation or CCD noise 
and has a drawback that an erroneous detection of the 
corresponding points occurs when there is an occlusion. 
Therefore, the gradient method is not suitable for 
generating three-dimensional data of products having a 
spherical or cylindrical shape, products made from a metal 
or glass and so on. 

SUMMARY OF THE INVENTION 
[0008] An object of the present invention is to 
generate highly reliable three-dimensional data of any 
types of objects such as a human face, an industrial 
product and so on. 

[0009] The three-dimensional data generating apparatus 
according to the present invention comprises: an obtaining 
portion for obtaining plural image data representing 
images obtained by photographing an object from different 
viewpoints, the plural image data each representing 
respective one of the images; a first detector for 
detecting a corresponding point in each of the images by 
the gradient method; a second detector for detecting a 
corresponding point in each of the images by the 
correlation method; a selector for selecting either one of 
the first detector or the second detector; and a generator 
for generating three-dimensional image of the object based 



on the corresponding point detect by the selected detector. 
[0010] Preferably, the apparatus may comprise an object 
judging portion for judging a type of an object, and the 
selector may select either one of the first detector or 
the second detector based on a judgment result obtained by 
the object judging portion. 

[0011] The obtaining portion may be comprised in the 
apparatus separately from other portions or integrally 
with other portions . 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0012] Fig. 1 shows an appearance of a three- 
dimensional image generating apparatus according to a 
first embodiment of the present invention. 

[0013] Fig. 2 is a block diagram showing a structure of 
the three-dimensional data generating apparatus according 
to the first embodiment. 

[0014] Figs. 3 is an example of two images. 

[0015] Fig. 4 is a graph showing an expression of a 

luminance gradient . 

[0016] Fig. 5 is an enlarged view of windows WA and WB. 

[0017] Fig. 6 shows a noted pixels GA and GB as well as 

pixels in the vicinity of the noted pixels. 

[0018] Fig. 7 illustrates a method wherein a multiple 

resolution image is used by way of an example. 

[0019] Fig. 8 shows a concept of detection using an 

epipolar line EL. 

[0020] Fig. 9 illustrates a relationship between a 
correlation value and difference. 

[0021] Fig. 10 is a flowchart showing a flow of the 



generation of three-dimensional data according to the 
f ir s t embodiment . 

[0022] Fig. 11 is a block diagram showing a three- 
dimensional data generating apparatus according to a 
second embodiment of the present invention. 
[0023] Fig. 12 is a flowchart showing processing for 
generating three-dimensional data according to the second 
embodiment . 

[0024] Fig. 13 shows an appearance of a three- 
dimensional data generating apparatus IC according to the 
present invention . 

[0025] Fig. 14 is a block diagram showing constitution 
of the three-dimensional data generating apparatus IC 
according to the third embodiment. 

[0026] Fig. 15 illustrates the method of obtaining 
reliability of the detection results by way of an example. 
[0027] Fig. 16 is a flowchart showing a flow of 
processing for obtaining the reliability of the detection 
results. 

[0028] Fig. 17 is a flowchart showing a flow of 
processing for generating the three-dimensional data 
according to a third embodiment. 

[0029] Fig. 18 shows a three-dimensional data 
generating apparatus as a modification. 



DESCRIPTION OF THE PREFERRED EMBODIMENTS 
(First Embodiment) 

[0030] In the first embodiment, colors of an object are 
detected to select a detector for generating three- 
dimensional data. 



[0031] Fig. 1 shows an appearance of the three- 
dimensional data generating apparatus 1 according to the 
first embodiment. Fig. 2 is a block diagram showing a 
structure of the three-dimensional data generating 
apparatus 1 according to the first embodiment. 
[0032] As shown in Figs. 1 and 2, the three-dimensional 
data generating apparatus 1 comprises two image input 
portions 21, processor 3A and a detector appointing 
portion 4 . 

[0033] Referring to Fig. 2, the image input portions 21 
each comprises an optical system 12p, an imaging element 
12q, a camera 12 including a control circuit not shown and 
so on. The image input portions 21 serves to generate 
image data. The optical system 12p comprises various 
lenses, a diaphragm and so on and serves to form an image 
on the imaging element 12q by receiving light from an 
object Q. The imaging element 12q converts the image of 
the object Q into an electric signal. As the imaging 
element 12q, there may be used a CCD type solid-state 
imaging element and the like. The image signal generated 
by the imaging element 12q is converted into a digital 
signal by an A/D converter so as to be image data. 
[0034] The detector appointing portion 4 is a sliding 
switch for appointing either one of detectors to be 
described later in this specification in accordance with 
manual operation of a user. This sliding switch can be 
switched among 3 modes of "portrait", "auto" and 
"landscape" . 

[0035] The processor 3A comprises various devices such 
as a display 11, an input device, a CPU, a RAM, a ROM, a 



control circuit, software such as a control program and 
the like. By above devices and so on, there can be 
realized functions of two image data memories 31, an 
object judging portion 32S, a detector selecting portion 
33S, a first corresponding point detector 34, a second 
corresponding point detector 35, a three-dimensional data 
generator 36, a three-dimensional data memory 37, an image 
display 38 and so on, 

[0036] The image data memories 31 each memorize the 
image data generated by the image input portion. 
[0037] The object judging portion 32S judges a type of 
the object Q by detecting colors of the object Q and 
examining whether or not the object Q comprises a 
predetermined color in a predetermined amount . 
[0038] Amounts of colors to be comprised in the object 
Q are decided by determining an average of color 
information of pixels comprised in the image data obtained 
by the image input portions 21. Characteristics of the 
object Q appear in the vicinity of center of an image and, 
therefore, only the amounts of colors in the vicinity of 
the center of the object Q may be calculated for the 
judgment of the type of the object Q. In order to judge a 
type of the object Q, the image data may be low in 
resolution since they are used only for the detection of 
colors and the calculation of the amounts of colors. The 
color information is typically obtained by a color 
equation c(C) of the following expression (1): 

c(C) = r(R) + g(G) + b(B) (1). 

[0039] Here, r(R), g(G), b(B) each is a function 
representing an amount of each of three primitive colors 



of light: red, green and blue. When r(R) , g(G) and b(B) 
each has 8 bits, an integer from 0 to 255 is assigned to 
each of R, G and B that are parameters for the function. 
The functions r(R) , g(G) and b(B) are increased in an 
amount of red, green and blue, respectively, in proportion 
to the parameters R, G and B. When R = 0, G = 0 and B = 0, 
c(C) is black. When R = 255, G = 255 and B = 255, c(C) is 
white. 

[0040] In the case of judging whether or not the object 
is a human, it is judged whether or not an area in the 
image data such as c{C) of each pixels in the vicinity of 
the center of the image corresponds to a specific color. 
It is judged that the object is a human when c(C) is a 
color of seasan in the present embodiment since the 
present embodiment is designed for a xanthoderm. 
Conditions under which c(C) be seasan are R ^ 250, G ^ 
240 and B 150. These values or areas specified by the 
values can be varied depending on the photographing 
conditions when so required. In addition, other colors 
may be set as c(C) instead of seasan depending on the race 
of human. Plural colors may be set as c(C) on the 
condition of logical OR so that plural races may be 
recognized as human. 

[0041] In the case where the "auto" is selected in the 
detector appointing portion 4, the detector selecting 
portion 33S selects, based on the result obtained from the 
judgment by the object judging portion 32S, either one of 
the first corresponding point detector 34 which employs 
the gradient method or the second corresponding point 
detector 35 which employs the correlation method for the 
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detection of corresponding points. For example, in the 
case where the object Q is a human, the first 
corresponding point detector 34 is selected; in the case 
where the object Q is not a human, the second 
corresponding point detector 35 is selected. 
[0042] In the case where the "portrait" or the 
"landscape" is selected in the detector appointing portion 
4, the object judging portion 32S does not perform the 
judgment processing. In this case, the detector selecting 
portion 33S operates in such a manner that the first 
corresponding point detector 34 is selected if the 
"portrait" is selected and the second corresponding point 
detector 35 is selected if the "landscape" is selected. 
[0043] The first corresponding point detector 34 
detects corresponding points of plural image data DT2 by 
the gradient method. The second corresponding point 
detector 35 detects corresponding points of plural image 
data DT2 by the correlation method. Descriptions on the 
detectors will be given later in this specification. 
[0044] The three-dimensional data generator 36 
generates three-dimensional data DT3 based on the 
detection result obtained by either one of the first 
corresponding point detector 34 or the second 
corresponding point detector 35. More specifically, the 
three-dimensional data DT3 are generated by reproducing 
three-dimensional image of the object based on the 
corresponding points of the plural image data DT2. The 
three-dimensional data memory 37 memorizes the three- 
dimensional data DT3. 

[0045] The image display 38 displays a three- 




dimensional image on the display 11 based on the three- 
dimensional data DT3 • A liquid crystal display or the 
like can be used as the display 11. 

[0046] The detection of the corresponding points by the 
gradient method in the first corresponding point detector 
34 will be described below. 

[0047] Fig. 3 is an example of plural image IM2; Fig. 4 
is a graph showing an expression of luminance gradient; 
Fig. 5 is an enlarged view of windows WA and WB; Fig. 6 
shows noted pixels GA and GB as well as pixels in the 
vicinity of the noted pixels; and Fig. 7 illustrates an 
example of the method wherein a multiple resolution image 
is used. 

[0048] Images IM2a and IM2b are images that are 
represented by each of the plural image data DT2a and DT2b. 
The image data DT2a and DT2b are data generated by 
separate image input portions 21. The image IM2b 
represents a standard image and the image IM2a represents 
a reference image. The images IM2a and IM2b respectively 
include the object image QA and QB of the object Q. The 
windows WA and WB are set in each of the images IM2a and 
IM2b. The window WB is a standard window set in the 
standard image, and the window WA is a reference window 
set in the reference image. Here, the windows WA and WB 
are at a position that includes right eye of the object. 
[0049] Referring to Fig. 5, each of the windows WA and 
WB is comprised of 9 x 9 pixels . 

[0050] The windows each may be comprised of 3 x 3 
pixels, 5x5 pixels, 7x7 pixels or other numbers of 
pixels . The noted pixels GA and GB respectively are 



pixels on coordinates that are In common with the windows 
WA and WB, As shown In Fig. 6, pixels neighboring to the 
pixel GA In vertical and horizontal directions 
respectively are defined as GAl , GA2 , GA3 and GA4 . The 
same applies to the pixel GB, and the pixel GB Is adjacent 
to pixels GBl , GB2 , GB3 and GB4 . 

[0051] Purpose for detecting the corresponding points 
Is to find on the reference Image points corresponding to 
all of points on the object Image QB of the object Q In 
the standard Image. Vector connecting the corresponding 
points when the reference Image and the standard Image are 
superimposed Is referred to as a "corresponding vector" In 
this specification, and the detection of corresponding 
points using the gradient method means the detection of 
corresponding vector (u, v) with respect to points on an 
image. The corresponding vector has a point of the 
standard image as its base point and has a point on the 
reference image as its end point. 

[0052] More specifically, the corresponding vector 
represents difference between the images of the windows WB 
and WA and has relationship indicated by the following 
expression ( 2 ) . 

dE/dt = Exu + Eyv + Et = 0 . (2). 

[0053] In the expression (2), Ex = 8E/8x, Ey = 8E/8y, u 
= 8x/8t; V = 8y/8t; Et = 8E/8t; E represents ixjmlnance; x 
represents a coordinate in horizontal direction of an 
image; y represents a coordinate in vertical direction; 
and t represents time. Note that, since the images dealt 
with in the present embodiments are stereographlcal Images, 
t corresponds to a distance between two cameras 12. 
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[0054] Since the object Images QA and QB of Fig. 3 are 
prepared by photographing almost the same portion of the 
object Q, it is possible to assume that the Iviminance 
gradient of a point on a coordinate of the object image QA 
is the same as that of a point on the same coordinate of 
the object image QB* 

[0055] Since the expression (2) contains two unknown 
letters u and v while containing only one equation, it can 
represent nothing more than that corresponding vectors are 
on a linear line as shown in Fig. 4. Therefore, another 
restraining condition ±s required between u and v in order 
to determine the corresponding vectors. Thus, there is 
made such an assumption that u and v are common to all the 
pixels in a certain area, for example, in an area of 3 x 3 
pixels. Solution for u and v with respect to all the 
pixels are obtainable by solving, by the least square 
method, a simultaneous equation that is made by adding the 
restraining condition to the expression (2). It is 
possible to reduce influence of quantization error in 
imaging or noise of imaging element by using a wider area 
such as 5 X 5 pixels and 6x6 pixels . 

[0056] If the corresponding vectors with respect to all 
the pixels are the same, there can be prepared 81 types of 
the expression (2). The corresponding vectors are 
obtainable by solving the expressions by using the least 
square method. 

[0057] For example. Ex, Ey and Et of the expression (2) 
with respect to the pixels GA and GB can be represented by 
the following expressions (4) to (6). 
Ex = {([GAl] - [GA2]) 
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+ ([GBl] - [GB2])}/2 (4) 

Ey = {([GA31 - [GA4]) 

+ ([GB3] - [GB4])}/2 (5) 

Et = [GB] - [GA] (6) 

[0058] In above expressions, the reference characters 
such as [GAl] respectively represent luminance values of 
the pixels shown in Fig. 6. For example, [GAl] represents 
the luminance value of the pixel GAl shown in Fig. 6. 
[0059] Ex, Ey and Et with respect to other pixels are 
obtained in the same manner. 

[0060] However, if the corresponding vectors are 
obtained by above method, difference of images of only a 
few pixels can be precisely measured. More specifically, 
in the case where the corresponding points are different 
to each other by more than a several pixels due to a 
relatively large parallax, it is impossible to obtain a 
precise corresponding point. This is a constraint imposed 
by a premise of the gradient method and, therefore, the 
reference image and the standard image must be similar to 
each other in order to obtain the corresponding point . 
[0061] In the case where the corresponding points are 
distant from each other, the multi resolution image is 
utilized in order to detect the corresponding points 
precisely. The method using the multi resolution image it 
self is known in the art . 

[0062] As shown in Fig. 7, the multiple resolution 
image FM comprises an original image F and a plurality of 

types of images Fl, F2 and F3 obtained by so compressing 

2 2 2 

the original image F as to be (1/2) , (1/4) , (1/8) or 

the like that are low in resolution. An image that is 
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relatively low in resolution typically has smaller number 
of pixels than that of an image that is high in resolution. 
Therefore, pyramid shape is formed by arranging an image 
that is the highest in resolution at the bottom and other 
images in the descending order of resolution. Thus, the 
multiple resolution image FM forms an image pyramid. 
[0063] By the use of the multiple resolution image FM, 
two positions distant to each other between images IM2a 
and IM2b when the resolution is high, i.e., when there is 
a large number of pixels, approaches to each other when 
there is a small number of pixels. Accordingly, it is 
possible to widen an area wherein the corresponding points 
are precisely obtained by the gradient method as a result 
of transmitting the difference of images from a low 
resolution to a high resolution. 

[0064] In the case of using the multiple resolution 
image as mentioned above, the corresponding vector is 
obtained with respect to the lowest resolution image to be 
set as an initial value. Then, another corresponding 
vector is obtained with respect to the second lowest 
resolution image. By repeating above processing, 
corresponding vectors high in resolution are obtainable if 
points on the images IM2a and IM2b that represents the 
same point on the object Q are largely different to each 
other. 

[0065] The detection of corresponding points by using 
the second corresponding point detector 35 is described 
below. 

[0066] Fig. 8 shows a concept of detection using an 
epipolar line EL. Fig. 9 illustrates a relationship 
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between a correlation value and difference. 
[0067] The correlation method is a method for detecting 
windows that are high in correlation and is useful for 
calculating correlation between windows of two images . 
[0068] In the calculation of correlation, a window that 
is the lowest in the correlation value is detected first* 
The correlation value is obtainable by adding differences 
between corresponding points of the windows . In the case 
of stereographical view, it is geometrically obvious that 
the corresponding windows are on the epipolar line and, 
therefore, the detection is conducted solely with respect 
to the epipolar lixie. 

[0069] In Fig. 8, ±he window Wa is moved in the image 
IM2a from left upper end to right bottom end, and a window 
Wb corresponding the window Wa is detected from the image 
IM2b at each of the positions. The window Wb in the image 
IM2b is shifted by one pixel from the same coordinate as 
that of the image IM2a. Here, the correlation value of 
the windows Wa and Wb is calculated to detect a position 
which is the lowest in the correlation value. The 
difference obtained hereby is then obtained with respect 
to all the points moved in the image IM2a to form a map of 
differences . The differences correspond to parallax in 
the case of the stereographical view and thus called 
"parallax map". 

[0070] In Fig. 9, the difference is used to enter the 
horizontal axis and the correlation value is used to enter 
the vertical axis. In the graph plotted by solid line, 
the correlation value is the lowest when the difference is 
detected with respect to seven pixels. Accordingly, it is 
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understood from the graph that a window that is seven 
pixels off from the noted window is the most correlative 
to the noted window. Parallax obtained by above 
calculation is seven pixels . 

[0071] Thus, parallax is obtainable with respect to 
pixels in the correlation method. However, the parallax 
does not always correspond to number of pixels . In order 
to obtain parallax with respect to subpixels by the 
correlation method, the graph of solid line is 
interpolated. One of the methods for the interpolation is 
a method wherein four points close to the lowest 
correlation value are noted and a point of intersection of 
lines formed by connecting each two points of the four 
points is set as the lowest correlation value. 
[0072] By using above interpolation method, the 
parallax corresponds to 7.3 pixels as shown by the broken 
line in Fig. 9. 

[0073] Generation of three-dimensional data DT3 
according to the first embodiment will be described below 
with reference to a flowchart. 

[0074] Fig. 10 is the flowchart showing a flow of the 
generation of three-dimensional data DT3 according to the 
first embodiment. 

[0075] As shown in Fig. 10, two sets of image data DT2 
are obtained by the two image input portions 21 (#11). 
The camera 12 used for photographing the object Q is 
subjected to calibration in advance of the photographing 
in order to grasp camera parameters such as projection 
matrix and lens aberration. In the case of photographing 
a moving object, the object is shoot in succession by 
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monocular vision using one camera. 

[0076] Amounts of colors of the object Q are detected 
based on the obtained image data DT2 (#12) to judge 
whether or not the object is a human (#13). 
[0077] If it is judged that the object Q is a human, a 
corresponding point of the image data DT2a and DT2b is 
detected by the first corresponding point detector 34, i.e. 
by the gradient method (#14). If it is not judged that 
the object Q is a human, the corresponding point of the 
image data DT2a and DT2b is detected by the second 
corresponding point detector 35, i.e. by the correlation 
method (#15) . 

[0078] Then, three-dimensional data DT3 are generated 
based on either of the detection results of the step 14 or 
the step 15 (#16) . 

[0079] As described above, it is possible to generate 
highly reliable three-dimensional data by judging types of 
the object by detecting its colors, followed by selecting 
a detector, or a detection method, for detecting a 
corresponding point based on the judgment result . 
[0080] In addition, it is possible to judge whether or 
not the object is a human based on any colors. Further, 
if the color r(R) solely is extracted from pixels of the 
object Q, characteristics of eyes, nose, forehead or 
cheeks are represented by red color gradation. It is 
possible to judge whether or not the object Q is a human 
by the red color gradation. The judgment may be made 
based on other various information such as infrared rays 
that radiates from the object by other various methods. 
(Second Embodiment) 
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[0081] In the first embodiment, the selection of 
detector for generating the three-dimensional data is 
performed by detecting colors of the object. In turn, the 
selection is performed by calculating luminance gradient 
in the second embodiment . 

[0082] Fig. 11 is a block diagram showing a three- 
dimensional data generating device IB according to the 
second embodiment of the present invention. 
[0083] As shown in Fig. 11, the three-dimensional data 
generating apparatus IB comprises two image input portions 
21 and a processor 3B. The image input devices are the 
same as those of the first embodiment. 
[0084] The processor SB of the second embodiment 
comprises a luminance gradient calculator 32T and a 
detector selecting portion 33T unlike the processor 3A of 
the first embodiment. The two image data memories 31, the 
first corresponding detector 34 , the second corresponding 
point detector 35, the three-dimensional data generator 36, 
the three-dimensional data memory 37 and the image display 
38 are the same as those of the first embodiment . 
Operation processing is carried out in the luminance 
gradient calculator 32T only when the "auto" is appointed 
by the detector appointing portion 4, and the detector 
selecting portion 33T executes the selection based on the 
calculation results in the same manner as the first 
embodiment. In the case where the "portrait" or the 
"landscape" is appointed by the detector appointing 
portion 4, the first corresponding point detector 34 or 
the second corresponding point detector 35 is selected 
based on the appointment in the same manner as the first 
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embodiment . 

[0085] The luminance gradient calculator 32T calculates 
a luminance gradient of an object based on image data DT2 . 
A luminance gradient Ex in the horizontal direction is 
obtainable by the expression ( 4 ) , and a luminance gradient 
in the vertical direction Ey is obtainable by the 
expression (5)» The luminance gradients Ex and Ey are 
calculated with respect to a several points of the image 
data DT2, and the luminance gradient of the object is 
calculated by, for example, obtaining a mean luminance 
gradient of the points . Two sets of image data DT2 used 
in the expressions (4) and (5) are obtained by 
photographing almost the same part and, therefore, it is 
possible to use only one of them for the calculation. 
[0086] The detector selecting portion 33T selects 
either one of the first corresponding pint detector 34 or 
the second corresponding point detector 35 based on the 
judgment result obtained by the luminance gradient 
calculator 32T. The luminance gradient calculator 32T 
performs the selection in such a manner that the first 
corresponding point detector 34 is selected when the 
luminance gradient is lower than a predetermined value 
while the second corresponding point detector 35 is 
selected when the luminance gradient is higher than the 
predetermined value. 

[0087] Processing for generating three-dimensional data 
according to the second embodiment will be described below 
with reference to a flowchart. 

[0088] Fig. 12 is the flowchart showing the processing 
for generating three-dimensional data according to the 
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second embodiment. 

[0089] As shown in Fig. 12, two sets of image data are 
obtained by two image input portions 21 in the same manner 
as the first embodiment (#21). 

[0090] A luminance gradient of an object is calculated 
based on the obtained image data (#22), and it is judged 
whether or not a result of the calculation is higher than 
a predetermined value (#23). 

[0091] In the case where the calculation result is 
higher than the predetermined value, a corresponding point 
of the two sets of image data is detected by using the 
second detector 25, i.e. by the correlation method (#24). 
In the case where the calculation result is lower than the 
predetermined value, the corresponding point of the two 
sets of image data is detected by the first detector 26, 
i.e. by the gradient method (#25). 

[0092] Then, in the same manner as the first embodiment, 
three-dimensional data is generated based on the detection 
result (#26). 

[0093] As described above, it is possible to generate 
the highly reliable three-dimensional data by calculating 
the luminance gradient of the object and selecting the 
detector for the corresponding point based on the 
calculation result according to the second embodiment - 
(Third Embodiment) 

[0094] In the first and second embodiments, generation 
of three-dimensional data is performed by using either one 
of the detectors. In turn, in the third embodiment, both 
of the detectors are used for the detection of 
corresponding point and the generation of three- 
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dlmensional data is performed by using detection result of 
higher reliability. 

[0095] Fig, 13 shows an appearance of a three- 
dimensional data generating apparatus IC according to the 
third embodiment. Fig. 14 is a block diagram showing a 
structure of the three-dimensional data generating 
apparatus IC according to the third embodiment. 
[0096] As shown in Figs. 13 and 14, the three- 
dimensional image generating apparatus IC comprises three 
image input portions 21 and a processor 3C. The image 
input portions operates in the same manner as those of the 
first embodiment . 

[0097] The processor 3C is comprised of the same 
components as those of the first embodiment; however, the 
processor 3C is provided with three image data memories 31, 
a reliability operation portion 32U, a reliability judging 
portion 33U and a three-dimensional data generator 36U. 
Functions of the first corresponding point detector 34, 
the second corresponding point detector 35, the three- 
dimensional data memory 37, the image display 38 and the 
like are realized in the same manner as the first 
embodiment . 

[0098] The reliability judging portion 33U compares a 
detection result obtained by the first corresponding point 
detector with a detection result obtained by the second 
corresponding point detector to judge which detection 
result is higher in reliability. 

[0099] The three-dimensional data generator 36U 
performs three-dimensional reproduction based on the 
detection result higher in reliability based on the 
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Judgment result of the reliability judging portion 33U. 
[0100] A method of obtaining reliability of the 
detection results will be described below with reference 
to a flowchart and drawings. 

[0101] Fig. 15 illustrates an example of the method of 
obtaining reliability of the detection results . 
[0102] The image data DT2e, DT2f and DT2g are 
respectively generated by different image input portions. 
The point Ptl is an arbitrary point included in the image 
data DT2e. The points Pt2 and Pt3 respectively are 
obtained by the first corresponding point detector and the 
second corresponding point detector as points on the image 
data DT2f and DT2g corresponding to the point Ptl. The 
point Ptl2 is a point in a three-dimensional space 
obtained based on the points Ptl and Pt2 . The point Pt3' 
is a point obtained by projecting the point Ptl2 on the 
image data DT2g. 

[0103] Fig. 16 is a flowchart showing a flow of 
processing for obtaining the reliability of detection 
results . 

[0104] In Fig. 16, there are obtained the points Pt2 
and Pt3 that are on the image data DT2f and DT2g, 
respectively, and correspond to the point Ptl on the image 
data DT2e (#91 ) . 

[0105] The point Ptl2 on a three-dimensional coordinate 
is calculated based on the points Ptl and Pt2 (#92). The 
point Ptl2 is then projected onto the image data DT2g to 
obtain Pt3' (#93). The points Pt3 and Pt3' are logically 
the same; however, in reality, the points Pt3 and Pt3' are 
different from each other due to calibration error or 
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characteristics of the camera 12. 

[0106] The difference between the point Pt3 and the 
point Pt3' is determined by comparison thereby to obtain 
reliability of the detection result (#94). Smaller the 
difference is, higher is the reliability. 
[0107] Processing for generation of the three- 
dimensional data according to the third embodiment is 
described below with reference to a flowchart, 
[0108] Fig. 17 is the flowchart showing a flow of the 
processing for generation of the three-dimensional data 
according to the third embodiment. 

[0109] As shown in Fig. 17, three sets of image data 
are obtained by the three image input portions 21 (#31). 
[0110] Detection of corresponding points are performed 
by the first corresponding point detector 34 and the 
second corresponding point detector 35 (#32), and 
reliability of the detection results are calculated (#33). 
[0111] Based on the calculation results, it is judged 
which one of the detection result obtained by the first 
corresponding point detector 34 and the detection result 
obtained by the second corresponding point detector 35 is 
higher in the reliability (#34). Thus, the detection 
result higher in the reliability is selected (#35 and #36). 
[0112] The three-dimensional data are generated based 
on the selected detection result (#37). 

[0113] As described above, since the reliability of the 
detection result obtained by the first corresponding point 
detector 34 is compared with the reliability of the 
detection result obtained by the second corresponding 
point detector 35 and the three-dimensional data are 
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generated based on the detection result higher in the 
reliability, it is possible to generate highly reliable 
three-dimensional data according to the third embodiment. 

4 

[0114] Further, the comparison of the reliability of 
the detection results is performed with respect to whole 
parts of the detection results in this embodiment; however, 
the comparison may be performed with dividing the 
detection results into partial areas, 

[0115] For example, reliability of detection result of 
the first corresponding point detector 34 and reliability 
of detection result of the second corresponding point 
detector 35 are obtained with respect to a plurality of 
partial areas, and the reliabilities are compared to each 
other with respect to each of the partial areas to select 
the higher reliability with respect to each partial areas . 
Then, detection results higher in the reliability are so 
synthesized as to complete three-dimensional reproduction, 
thereby to generate the three-dimensional data. 
[0116] Since the parts that are high in reliability are 
thus selected from the detection results obtained by both 
of the first corresponding point detector 34 and the 
second corresponding point detector 35, it is possible to 
generate three-dimensional data high in precision. 
[0117] In the embodiments described above, parts 
corresponding to a background may be deleted from the 
image data in advance of the generation of three- 
dimensional data. Processing of the deletion of 
background may be carried out by various known methods. 
The deletion of background enables to eliminate redundant 
data and thus to reduce erroneous corresponding points. 
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[0118] In the embodiments described above, the image 
input portions 21 and the processor 3 are integrally 
comprised in the three-dimensional data generating 
apparatus 1, IB and IC; however, the image input portions 
and the processor 3 may be separately comprised in the 
three-dimensional data generating apparatus. Fig. 18 is a 
block diagram showing the modification of three- 
dimensional data generating apparatus. In this 
modification, a computer such as a personal computer is 
used for the processor 3. In Fig. 18, a multiple-lens 
camera 2 has a plurality of image input portions 21 and 
output portions 22A, 22B and 22C for outputting image 
signals obtained by the image input portions . Reference 
numeral 22A denotes a connection port for transferring the 
image signals to an external device via interface such as 
USB; reference numeral 22B denotes an output port for 
transferring the image signals to an external device via 
wireless communication such as infrared communication and 
reference numeral 22C denotes a driver for recording the 
image signals to a portable recording medium such as a 
memory card. In turn, a computer 3 is provided with input 
portions 30A, 30B and 30C that correspond respectively to 
the output portions 22A, 22B and 22C. The input portion 
30A is a connection port for receiving signals from an 
external device via interface; the input portion 30B is an 
input port for receiving signals from an eternal device 
via wireless communication and the input portion 30C is a 
driver for reading data memorized in a portable recording 
medium. An image data memory 31 serves to receive the 
image signals input into the input portions and to store 



the same . 

[0119] Thus, it is possible to improve configuration of 
the multiple -lens camera by reducing functions and devices 
required for multiple- lens camera as a result of 
performing burdensome processing such as the generation of 
three-dimensional data using a computer, 
[0120] Whole or parts of structure of the three- 
dimensional data generating apparatus 1, IB or IC, number 
of components, contents of processings or orders of 
processings may be modified as required in the scope and 
spirits of the present invention. 

[0121] According to the embodiments described above, it 
is possible to generate highly reliable three-dimensional 
irrelevant to the type of an object. 

[0122] Further, according to the embodiments described 
above, it is possible to select a detector suitable for 
detecting a corresponding point of plural image data 
easily. Moreover, it is possible to select a highly 
reliable detection result of corresponding point to 
generate three-dimensional data of high precision. 



